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Abstract. We give theorems that can be used to upper bound the densities 
of packings of different spherical caps in the unit sphere and of translates of 
different convex bodies in Euclidean space. These theorems extend the lin- 
ear programming bounds for packings of spherical caps and of convex bodies 
through the use of semidefinite programming. We perform explicit compu- 
tations, obtaining new bounds for packings of spherical caps of two different 
sizes and for binary sphere packings. We also slightly improve bounds for the 
classical problem of packing identical spheres. 



1. Introduction 

How densely can one pack given objects into a given container? Problems of 
this sort, generally called packing problems, are fundamental problems in geometric 
optimization. 

An important example having a rich history is the sphere packing problem. 
Here one tries to place equal-sized spheres with pairwise disjoint interiors into 
rt-dimensional Euclidean space while maximizing the fraction of covered space. In 
two dimensions the best packing is given by placing open disks centered at the 
points of the hexagonal lattice. In three dimensions, the statement that the best 
sphere packing has density n/Vl8 = 0.7404. . . was known as Kepler's conjecture; 
it was proved by Hales [20] in 1998 by means of a computer-assisted proof. 

Currently, one of the best methods for obtaining upper bounds for the density of 
sphere packings is due to Cohn and Elkies [8] . In 2003 they used linear programming 
to obtain the best known upper bounds for the densities of sphere packings in 
dimensions 4, . . . , 36. They almost closed the gap between lower and upper bounds 
in dimensions 8 and 24. Their method is the noncompact version of the linear 
programming method of Delsarte, Goethals, and Seidel [llj for upper-bounding 
the densities of packings of spherical caps on the unit sphere. 

From a physical point of view, packings of spheres of different sizes arc relevant 
as they can be used to model chemical mixtures which consist of multiple atoms 
or, more generally, to model the structure of composite material. For more about 
technological applications of these kind of systems of polydisperse, totally impen- 
etrable spheres we refer to Torquato [391 Chapter 6]. In recent work, Hopkins, 
Jiao, Stillinger, and Torquato [UJ presented lower bounds for the densities of 
packings of spheres of two different sizes, also called binary sphere packings. 
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In coding theory, packings of spheres of different sizes are important in the design 
of error-correcting codes which can be used for unequal error protection. Masnick 
and Wolf [3T] were the first who considered codes with this property. 

In this paper we extend the linear programming method of Cohn and Elkies to 
obtain new upper bounds for the densities of multiple-size sphere packings. We also 
extend the linear programming method of Delsarte, Goethals, and Seidel to obtain 
new upper bounds for the densities of multiple-size spherical cap packings. 

We perform explicit calculations for binary packings in both cases using semidef- 
inite, instead of linear, programming. In particular we complement the constructive 
lower bounds of Hopkins, Jiao, Stillinger, and Torquato by non-constructive upper 
bounds. Insights gained from our computational approach are then used to improve 
known upper bounds for the densities of monodisperse sphere packings in dimen- 
sions 4, . . . 9, except 8. The bounds we present improve on the best-known bounds 
due to Cohn and Elkies |S]. 

1.1. Methods and theorems. We model the packing problems using tools from 
combinatorial optimization. All possible positions of the objects which we can use 
for the packing are vertices of a graph and we draw edges between two vertices 
whenever the two corresponding objects cannot be simultaneously present in the 
packing because they overlap in their interiors. Now every independent set in this 
conflict graph gives a valid packing and vice versa. To determine the density of the 
packing we use vertex weights since we want to distinguish between "small" and 
"big" objects. For finite graphs it is known that the weighted independence number 
can be upper bounded by the weighted theta number. Our theorems for packings 
of spherical caps and spheres are infinite-dimensional analogues of this result. 

Let G = (V, E) be a finite graph. A set I C V is independent if no two vertices 
in / are adjacent. Given a weight function w: V — > M>o, the weighted independence 
number of G is the maximum weight of an independent set, i.e., 

a w (G) — max< > w(x) : I C V is independent >. 



Finding a w {G) is an NP-hard problem. 

Grotschel, Lovasz, and Schrijver [TH] defined a graph parameter that gives an 
upper bound for a w and which can be computed efficiently by semidefinite opti- 
mization. It can be presented in many different, yet equivalent ways, but the one 
convenient for us is 

§' w (G)=mhx M 

K — (w 1 / 2 )(k; 1 / 2 ) t is positive semidefinite, 
K(x, x) < M for all x e V, 

K(x, y) < for all {x, y} £ E where x ^ y, 

M eR, K eR VxV is symmetric. 

Here we give a proof of the fact that "&' W (G) upper bounds a w (G). In a sense, 
after discarding the analytical arguments in the proofs of Theorems 1 1 . 2| and 1.3 we 
are left with this simple proof. 

Theorem 1.1. For any finite graph G — (V, E) with weight function w: V K>o 
we have a w (G) < <d' w (G). 

Proof. Let / C V be an independent set of nonzero weight and let K £ R VxV , 
M e R be a feasible solution of $^(G). Consider the sum 

£ wixf'Mvf^Kix^). 

x,y£l 
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This sum is at least 

x 7 yGl x£l 

because K — (w 1 / 2 )(w 1 ^ 2 ) T is positive semidefinitc. 
The sum is also at most 

^2 w(x)K(x, x) < M w ( x ) 

x£l x£l 

because K(x,x) < M and because K(x,y) < whenever x ^ y as I forms an 
independent set. Now combining both inequalities proves the theorem. □ 

Multiple-size spherical cap packings. We first consider packings of spherical 
caps of several radii on the unit sphere S"™ -1 — {x £ R™ : x ■ x — 1 }. The spherical 
cap with angle a £ [0, it] and center x £ S 11 ^ 1 is given by 

C(x, a) — {y £ S 71 ^ 1 : x ■ y > cos a }. 

Its normalized volume equals 

/ \ ^n-liS™ 2 ) f 1 , 2\(n-3)/2 > 

w{a) = ^s^rLJ l - u) du > 

where u} n (S n ^ 1 ) = (2n n ^ 2 )/T(n/2) is the surface area of the unit sphere. Two 
spherical caps C{x\, a\) and C{x2, ot^) intersect in their topological interiors if and 
only if the inner product of X\ and X2 lies in the interval (cos(ai+a2), 1]. Conversely 
we have 

C(xi,ai)° nC(x 2 ,a 2 )° =® x% ■ x 2 < cos(ai + a 2 ). 

A packing of spherical caps with angles ati, . . . , a at is a union of any number of 
spherical caps with these angles and pairwise-disjoint interiors. The density of the 
packing is the sum of the normalized volumes of the constituting spherical caps. 

The optimal packing density is given by the weighted independence number 
of the spherical cap packing graph. This is the graph with vertex set S 1 ™ -1 x 
{1, ...,N}, where a vertex (x,i) has weight w(pti), and where two distinct ver- 
tices (x,i) and (y, j) are adjacent if cos(a^ + a,-) < x ■ y. 

In Section [2] we will extend the weighted theta prime number to the spherical 



cap packing graph. There we will also derive Theorem 1.2 below, which gives 
upper bounds for the densities of packings of spherical caps. We will show that the 
sharpest bound given by this theorem is in fact equal to the theta prime number. 

In what follows we denote by P£ the Jacobi polynomial p^™ _3 )/ 2 '(" -3 )/ 2 ) f 
degree k, normalized so that P^(l) = 1. 

Theorem 1.2. Let cc\, ■ ■ ■ , ckjv £ (0, 7r] be angles and for i, j — 1, . . . , N and k > 

let fij t k be real numbers such that fi^h — fji,k an d X)fc°=o \fijM < 00 f or a ^ h 3- 
Write 



(1) /«(«) y,J\,,l'-<>n. 

fc=0 

Suppose the functions satisfy the following conditions: 

(i) {fij.o ~ w { a i) 1 ^ 2w { a j) 1 ^ 2 ) 1 ^ j = i * s positive semidefinite; 

(ii) (/y,fc)^ =1 is positive semidefinite for k > 1; 

(iii) fij{u) < whenever —l<u< 003(0;; + ctj). 

Then the density of every packing of spherical caps with angles ot\, . . . , a at on the 
unit sphere S 1 ™" 1 is at most max{ fu(l) : i = 1, . . . , N }. 
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When N = 1, Theorem 1.2 reduces to the linear programming bound for spheri- 
cal cap packings of Delsarte, Goethals, and Seidel [11]. In Section[4]we use semidef- 
inite programming instead of linear programming to perform explicit computations 
for N — 2. 



Translational packings of bodies and multiple-size sphere packings. We 

now deal with packings of spheres with several radii in R™. Theorem |1 .3| presented 
below can be used to find upper bounds for the densities of such packings. In fact, 
it is more general and can be applied to packings of translates of different convex 
bodies. 

Let K-i, . . . , K,jsr be convex bodies in W l . A translational packing of JC\, . . . , /Cjv 
is a union of translations of these bodies in which any two copies have disjoint 
interiors. The density of a packing is the fraction of space covered by it. There are 
different ways to formalize this definition, and questions appear as to whether every 
packing has a density and so on. We postpone further discussion on this matter 
until Section [3] where we give a proof of Theorem |1.3[ 

Our theorem can be seen as an analogue of the weighted theta prime number 
for the infinite graph G whose vertex set is E™ x {1, . . . , N} and in which ver- 
tices (x,i) and are adjacent if x + ICi and y + ICj have disjoint interiors. The 
weight function we consider assigns weight vol/C^ to vertex (x 7 i) € M" x {1, . . . , N}. 
We will say more about this interpretation in Section [3j 

For the statement of the theorem we need some basic facts from harmonic anal- 
ysis. Let /: R" — > C be an L 1 function. For u £ K", the Fourier transform of / 
at u is 

/(«)= / f(x)e~ 2 ™-*dx. 

We say that function / is a Schwartz function (also called a rapidly-decreasing 
function) if it is infinitely differentiable, and if any derivative of /, multiplied by any 
power of the variables X\, . . . , x n , is a bounded function. The Fourier transform 
of a Schwartz function is a Schwartz function, too. A Schwartz function can be 
recovered from its Fourier transform by means of the inversion formula: 



f(x) = / f(u)e 2 ™ x du 



for all x € 



Theorem 1.3. Let K X) . . . , JC N be convex bodies m R n and let f: R n ->• R NxN 
be a matrix-valued function whose every component fij is a Schwartz function. 
Suppose f satisfies the following conditions: 

(i) the matrix (/^(O) — (vol/Ci) 1 / 2 (vol/Cj) 1 / 2 )^._ 1 is positive semidefinite; 

(ii) the matrix of Fourier transforms (/y (m)) . is positive semidefinite for 
every u g M n \ {0}; 

(hi) fij{x) < whenever K° n (x + JC°) = 0. 

Then the density of any packing of translates of K\, /Cat in the Euclidean 
space R™ is at most max{ fu(0) : i = 1, . . . , N}. 

We give a proof of this theorem in Section [3] When N = 1 and when the convex 
body K-i is centrally symmetric (an assumption which is in fact not needed) then 
this theorem reduces to the linear programming method of Cohn and Elkies 8J. 

We apply this theorem to obtain upper bounds for the densities of binary sphere 



packings, as we discuss in Section 1.3 
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Figure 1. Upper bounds on the packing density for N — 2. The 
horizontal and vertical axes carry the spherical cap angle; the colors 
indicate the density, or in the case of plot (b) whether the SDP 
bound or the geometric bound is sharper. 



1.2. Computational results for binary spherical cap packings. We applied 
Theorem \1.2\ to compute upper bounds for the densities of binary spherical cap 
packings. The results we obtained are summarized in the plots of Figure [T] 

For n — 3, Florian |13l 114] provides a geometric upper bound for the density of 
a spherical cap packing. He shows that the density of a packing on S 2 of spherical 
caps with angles a%, . . . , ajy S (0, 7r/3] is at most 

max D(ai,aj,ak), 

l<i<j<k<N J 

where D(di, a.j, otk) is defined as follows. Let T be a spherical triangle in S 2 such 
that if we center the spherical caps with angles a.i, otj, and a.k at the vertices of T, 
then the caps intersect pairwise at their boundaries. The number Z?(ai, <x,-, is 
then defined as the fraction of the area of T covered by the caps. 

In Figure [Tb] we see that for N = 2 it depends on the angles whether the geo- 
metric or the semidefinite programming bound is sharper. In particular we see that 
near the diagonal the semidefinite programming bound is at least as good as the 



geometric bound; see also Figure 2a 
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We can construct natural multiple-size spherical cap packings by taking the 
incircles of the faces of spherical Archimedean tilings. A sequence of binary packings 
is for instance obtained by taking the incircles of the prism tilings. These are the 
Archimedean tilings with vertex figure (4, 4, k) for k > 3 (although strictly speaking 
for k — 4 this is a spherical Platonic tiling). The question then is whether the 
packing associated with the fc-prism has maximal density among all packings with 
the same cap angles n/k and tt/2 — ir/k, that is, whether the packing is maximal. 
The packing for k = 3 is not maximal while the one for k — 4 trivially is, since here 
there is only one cap size, and adding a 9th cap yields a density greater than 1. 

Heppes and Kertesz J33] showed that the configurations for k > 6 are maximal, 
and the remaining case k — 5 was later shown to maximal by Florian and Hep- 
pes [15] . Florian [13] showed that the geometric bound given above is in fact sharp 
for the cases where k > 6, and for the k = 5 case it is not sharp but still good 
enough to prove maximality (notice that given a finite number of cap angles, the 
set of obtainable densities is finite). 

Now we illustrate that Theorem |1.2| gives a sharp bound for the density of the 
packing associated to the 5-prism, thus giving a simple proof of its maximality. The 
theorem also provides a sharp bound for n — 4 but whether it can provide sharp 
bounds for the cases n > 6 we do not know at the moment. The numerical results 
are not decisive. 

We shall exhibit functions 

4 

which satisfy the conditions of Theorem 1 1 . 2 1 with /n(l) = hw(ot\) + 2w(ct2) where 



tt 3?r 1 / tt\ 1 / 3tt 

"i = g, «2 = w \ a V " 2 V ° S 5/ ' W ^ 2 ' = 2 V ° S 10 

By complementary slackness of semidefinitc optimization the coefficients have 
to satisfy the following linear conditions: 

= hi (cos ?f\ = f n (cos - f u (cos = / 12 (0) - / 22 (-l); 

the product 

Vn,o fi2,o\ ( 2hw{oix) 10 y /w(a 1 )w(a 2 y 

v /i2,o /22,o/ \10y/w(ai)w(a2) 4w(a 2 ) 

equals 

25w(q;i) 2 + 10w(ai)w(at2) y/w(ai)w(oi2)(10w(ai) + 4w(oc2))\ 

K y/w(ai)w(a2)(25w(ai) + 10w(a2)) 10w(ai)w(a2) + 4:w(a 2 ) 2 / ' 

for k = 1, . . . , 4 the product of the two matrices ( ^} 1,k ^, 12 ' k ) and 

\J12,k J22,k I 



'w( ai )(5P k (l) + 10F fe (cos f ) + 10P fc (cos f )) ^w(a 1 )w(a 2 )lOP k (0) 
^w(a 1 )w{a 2 )lOP k (0) w(a 2 )(2P k (l) + 2P fe (-l)) 

equals zero. This linear system together with the additional assumptions 

o = /„(-i) -/!>(- jl) 

has a one-dimensional space of solutions from which it is easy to select one which 
fulfills all requirements of Theorem |1.2| 

For the remaining 13 Archimedean solids in dimension n = 3 we are only able 
to show maximality of the packing associated to the truncated octahedron, the 



Upper bounds for packings of spheres of several radii 



7 



Archimedean solid with vertex figure (6, 6, 5). Its density is 0.9056 . . ., the geometric 
bound shows that the density is at most 0.9088..., and using the semidefinite 
program we get 0.9079 ... as an upper bound. The first packing with caps of angles 
arcsin(l/3) and arcsin(l/-\/3) which would be denser is obtained by taking 19 of 
the smaller caps and 4 of the bigger caps, and has density 0.9103 . . . The upper 
bounds show however that it is not possible to obtain this dense a packing, thus 
showing that the truncated octahedron packing is maximal. 
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(c) n = 5 

Figure 2. Upper bounds on the packing density for N = 1, the 
horizontal axis carries the spherical cap angle and the vertical axis 
the packing density. 

We also used our programs to plot the upper bounds for N = 1, the classical 
linear programming bound of Delsarte, Goethals, and Seidel [TT], for dimensions 
n = 3, 4, and 5 in Figure [2j To the best of our knowledge these kinds of plots were 
not made before and they seem to reveal interesting properties of the bound. For 
better orientation we show in the plots the packings where the linear programming 
bound is sharp (cf . Levenshtein [35] ; Cohn and Kumar [5] proved the much stronger 
statement that these packings provide point configurations which are universally 
optimal). The dotted line in the plot for n — 3 is the geometric bound, and since we 
know that both the geometric (cf. Florian [13) 1 and the semidefinite programming 
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Figure 3. The horizontal axis carries the ratio between the radii 
of the small and the large spheres. The vertical axis carries our 
upper bound. Our bounds for dimensions 2, . . . , 5 are shown 
together. 

bounds are sharp for the given configurations, we know that at these peaks the 
bounds meet. 

An interesting feature of the upper bound seems to be that it has some periodic 
behavior. Indeed, the numerical results suggest that for n = 3, the two bounds 
in fact meet infinitely often as the angle decreases, and that between any two of 
these meeting points the semidefinite programming bound has a similar shape. 
Although in higher dimensions we do not have a geometric bound, the semidefinite 
programming bound seems to admit the same kind of periodic behavior. 

1.3. Computational results for binary sphere packings. We applied Theo- 
rem [L3] to compute upper bounds for the densities of binary sphere packings. The 
results we obtained are summarized in the plot of Figure [3j where we show bounds 
computed for dimensions 2, . . . , 5. A detailed account of our approach is given in 
Section [5j We now quickly discuss the bounds presented in Figure [3j 
Dimension 2. Only in dimension 2 have binary sphere (i.e., circle) packings been 
studied in depth. We refer to the introduction in the paper of Heppes |2ll which 
surveys the known results about binary circle packings in the plane. 

Currently one of the best-known upper bounds for the maximum density of a 
binary circle packing is due to Florian [TJ]. Florian's bound states that a packing 
of circles in which the ratio between the radii of the smallest and largest circles is r 
has density at most 

7rr 2 + 2(1 — r 2 ) arcsin(r/(l + r)) 
2rV2r + 1 ' 
and that this bound is achieved exactly for r = 1 (i.e., for classical circle packings) 
and for r = in the limit. 

The question arises of which bound is better, our bound or Florian's bound. 
From our experiments, it seems that our bound is worse than Florian's bound, at 
least for r < 1. For instance, for r = 1/2 we obtain the upper bound 0.9174426 . . ., 
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whereas Florian's bound is 0.9158118 . . . Whether this really means that the bound 
of Theorem |1.3| is worse than Florian's bound, or just that the computational 
approach of Section [5] is too restrictive to attain his bound, we do not know. 

It is interesting to note that for r = 1, that is, for packings of circles of one size, 
our bound clearly coincides with the one of Cohn and Elkies [5] . This bound seems 
to be equal to n/^/l2, but no proof of this is known. 

Dimension 3. Much less is known in dimension 3. In fact we do not know about 
other attempts to find upper bounds for the densities of binary sphere packings in 
dimensions 3 and higher. 

Let us compare our upper bound with the lower bound by Hopkins, Jiao, Still- 
inger, and Torquato [25] ■ The record holder for r > 0.2 in terms of highest density 
occurs for r = 0.224744 . . . and its density is 0.824539 . . . Our computations show 
that there cannot be a packing with this r having density more than 0.8617125..., 
so this leaves a margin of 5%. 

Another interesting case is r = v2 — 1 = 0.414. . . Here the best-known lower 
bound of 0.793. . . comes from the NaCl-alloy. The large spheres are centered at 
a face centered cubic lattice and the small spheres are centered at a translated 
copy of the face centered cubic lattice so that they form a jammed packing. Our 
upper bound for r = y2 — 1 is 0.813 . . ., less than 3% away from the lower bound. 
Therefore, we believe that proving optimality of the NaCl-alloy might be doable. 
Dimension 4 and beyond. In higher dimensions even less is known about binary 
sphere packings. We observed from Figure [3] that it seems that the upper bound 
is decreasing: as the radius of the small sphere increases from 0.2 to 1, the bound 



seems to decrease. This suggests that the bound given by Theorem 1.3 is decreasing 
in this sense, but we do not know a proof of this. 

We also do not know the limit behavior of our bound when r approaches 0. Due 
to numerical instabilities we could not perform numerical calculations in this regime 
of r. 

1.4. Improving the Cohn-Elkies bounds. We now present a theorem that can 
be used to find better upper bounds for the densities of monodisperse sphere pack- 
ings than those provided by Cohn and Elkies [5] ; our theorem is a strengthening of 
theirs. 

Fix e > 0. Given a packing of spheres of radius 1/2, we consider its s-tangency 
graph, a graph whose vertices are the spheres in the packing, and in which two 
vertices are adjacent if the distance between the centers of the respective spheres 
lies in the interval [1, 1 + e). 

Let M{e) be the least upper bound on the average degree of the e-tangency 
graph of any sphere packing. Our theorem is the following: 

Theorem 1.4. Take = eq < e± < • •• < e m and let f:W l ^>-Rbea Schwartz 
function such that 

(i) /(0) > voli?, where B is the ball of radius 1/2; 

(ii) f (u) > for all u G W 1 \ {0}; 

(iii) f{x) < whenever \\x\\ > 1 + e m ; 

(iv) f(x) < rjk whenever \\x\\ € [1 +Su-i, 1 + £fc) with r)k > 0, for k — 1, . . . , m. 
Then the density of a sphere packing is at most the optimal value of the following 
linear programming problem in variables A\, . . . , A m : 

max f(Q) + r\ x A x H h r] m A m 

(2) Ai + --- + A k < U{e k ) for k = 1, m, 

Ai > for i = 1, . . . , m, 

where U(ek) > M(ek) for k — 1, . . . , m. 
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In Section [6] we give a proof of Theorem 1.4 and show how to compute upper 
bounds for M(e) using the semidefinite programming bounds of Bachoc and Val- 
lentin |3j for the sizes of spherical codes. There we also show how to use semidefinite 
programming and the same ideas we employ in the computations for binary sphere 
packings (cf. Section [5]) to compute better upper bounds for the densities of sphere 
packings. 

In Table [l] we show the upper bounds obtained through our application of Theo- 
rem [T]4] To better compare our bounds with those of Cohn and Elkies, on Table [j] 
we show bounds for the center density of a packing, the center density of a packing 
of unit spheres being equal to A/volS, where A is the density of the packing, 
and B is a unit ball. 

We omit dimension 8 because for this dimension it is already believed that the 
Cohn-Elkies bound is itself optimal, and therefore as is to be expected we did not 
manage to obtain any improvement over their bound. We also note that the bounds 
by Cohn and Elkies are the best known upper bounds in all other dimensions shown. 



Dimension Lower bound Cohn-Elkies bound New upper bound 



0.12500 
0.08839 
0.07217 
0.06250 
0.04419 



0.13126 
0.09975 
0.08084 
0.06933 
0.05900 



0.130587 
0.099408 
0.080618 
0.069193 
0.058951 



Table 1. For each dimension we show the best lower bound 
known, the bound by Cohn and Elkies [8], and the upper bound 



coming from Theorem 1.4 



In dimension 3 the Cohn-Elkies bound is 0.18616 whereas the optimal sphere 
packing has center density 0.17678. We can improve the Cohn-Elkies bound to 
0.184559 which is also better than the upper bound 0.1847 due to Rogers [33]. 

2. Multiple-size spherical cap packings 

In this section we prove Theorem |1.2| and discuss its relation to an extension of 
the weighted theta prime number for the spherical cap packing graph. 

2.1. Proof of Theorem [l]2j Let x x , . . . , x m £ S" 1 " 1 and r: {l,...,m} 
{1, . . . , N} be such that 

m 
i=l 

is a packing of spherical caps on S n . 
Consider the sum 

m 

(3) ^ w ( a r(i)) 1/2 w(a r{j )) 1/2 f r (i )r ( :j ){x l ■ Xj). 

By expanding f r {i)r(j){ x i ' x j) according to ([!]) this sum is equal to 

00 m 

X! W ( a r^)) 1/2w ( a rU)) 1/ ' 2 fr( i )rij),kPk{x i ' Xj). 
k=0i,j—l 

By the addition formula (cf. e.g. Section 9.6 of Andrews, Askey, and Roy [jQ) for 
the Jacobi polynomials PJ} the matrix (P£(xi ■ Xj)^ _ x is positive semidefinite. 
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From condition (ii) of the theorem, we also know that the matrix (fr(i)r(j),k)™j _i 
is positive semidehnite for k > 1. So the inner sum above is nonnegative for k > 1. 
If we then consider only the summand for k — we see that ([3| is at least 

m / m \ 2 

(4) ^2 w ( a r(t)) 1/2 w(a rU) ) 1/2 f r(l)r(j)fi P^ (Xi ■ xj) > ( ^2w{a t ) J , 

i,j=l M=l ' 

where the inequality follows from condition (i) of the theorem. 

Now, notice that whenever i ^ j, the caps C(xi,a r ^) and C(xj,a r (j\) have 
disjoint interiors. Condition (iii) then implies that f r (i) r (j){ x i ' x j) 0. So we see 
that ^ is at most 



(5) y,w(a»)/r(i)r(i)(l) < max{ : t = 1, AT } ^ u>(aj). 

i=l i=l 

So Q is at least Q and at most yielding 

m 

5^10(0,) <max{/ ii (l):t = l,...,JV}. □ 



2.2. Theorem 1.2 and the Lovasz theta number. We now briefly discuss a 



generalization of $' w to infinite graphs and its relation to the bound of Theorem 1.2 
Similar ideas were developed by Bachoc, Nebe, Oliveira, and Vallentin [3]. 

Let G = (V, E) be a graph, where V is a compact space, and let w : V — > M>o 
be a continuous weight function. An element in the space C(V x V) of real- valued 
continuous functions over V x V is called a kernel. A kernel K is symmetric if 
K{x,y) — K(y,x) for all x,y £ V. It is positive if it is symmetric and if for 
any N £ N and for any x\, . . . , a;^ € V, the matrix (K(xi, ■ _ 1 is positive 
semidehnite. The weighted theta prime number of G is defined as 
= inf M 

AT — w 1 / 2 ® (w 1 / 2 )* is a positive kernel, 
(6) K(x,x)<M forallxey, 

K{x, y) < for all {x, y} ^ E where x ^ y, 

M eR, K e C(V x V) is symmetric. 

One may show, mimicking the proof of Theorem |1.1[ that $' W (G) > a w (G). 

Let G = (V, E) be the spherical cap packing graph as defined in Section 
We will use the symmetry of this graph to show that ^ gives the sharpest bound 
obtainable by Theorem |1.2| 

The orthogonal group 0(n) acts on and this defines the action of 0(n) 

on the vertex set V = S^ 1 x {1, . . . , N} by A(x, i) = (Ax, i) for A e 0(n). The 
group average of a kernel K £ C(V x V) is given by 



K((x,i),(y,j))= / K(A(x,i) 1 A(y,j))dv(A), 

JO(n) 

where /i is the Haar measure on 0(n) normalized so that /i(0(n)) = 1. If (AT, M) is 
feasible for ([6]), then (AT, M) is feasible too. This follows since for each A € 0(n), 
a point (x,i) has the same weight as A(x,i), and two points (x,i) and (y,j) are 
adjacent if and only if A(x,i) and A(y, j) are adjacent. Since (K, M) and (AT, M) 
have the same objective value M, and since AT is invariant under the action of 0(n), 
we may restrict to 0(n)-invariant kernels (i.e., kernels K such that K(Au, Av) = 
K(u,v) for all A £ 0(n) and u, v G V) in finding the infimum of jSJ. 

Schoenberg [33] showed that a symmetric kernel K E C(S n ^ 1 x S*™ -1 ) is positive 
and 0(n)-invariant if and only if it lies in the cone spanned by the kernels (x, y) i— > 
PJ!{x ■ y). We will use the following generalization for kernels over V x V. 
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Theorem 2.1. A symmetric kernel K e C(V x V), with V = S n 1 x {1, . . . ,N}, 
is positive and 0(n) -invariant if and only if 

(7) K((x,i),(y,j)) = f ij (x-y) 

with 

oo 

/«(«) = £/«,*i?(«)> 

fc=0 

where ._ 1 is positive semidefinite for all k > and J2T=o \fij,k\ < 00 f or 

all i, j = 1, . . . , N, implying in particular that we have uniform convergence above. 

Before we prove the theorem we apply it to simplify problem ([6]). If if is an 
0(n)-invariant feasible solution of (|6j), then K — w 1 ^ 2 ® (w 1 ^ 2 )* is a positive O(n)- 
invariant kernel, and hence can be written in the form ([7]). Using in addition that 
Pq™ = 1, problem Q reduces to 

d' w {G) = inf M 

/ii (0) + iy(ct!f) < M for aU 1 < i < N, 

fij(u) + (w(ai)w(aj)) 1 ^ 2 < when — 1 < u < cos(ai + a.j), 

M £ K and {fij,k) i ^ x positive semidefinite for all k > 0. 

By substituting /^.o — (w(cti)w(aj)) 1 / 2 for /y^ we see that the solution to this 
problem indeed equals the sharpest bound given by Theorem |1.2| 



Proof of Theorem \2.1\ If we endow the space C(S n r ) of real- valued continuous 
function on the unit sphere S n ~ l with the usual L 2 inner product, then for /, 

9 e C(V), 

N 

(/>#)= f(x,i)g(x,i)dw(x) 



i=i 



gives an inner product on C(V). The space C(S n ) decomposes orthogonally as 

oo 

C(S n - 1 ) = ($H k , 

k=0 

where is the space of homogeneous harmonic polynomials of degree k restricted 
to S"- 1 . With 

H k s = { f G C(V) : there is a 3 G H k such that /(■, j) = S lJ g{-) }, 
it follows that C(V) decomposes orthogonally as 

oo N 
fc=0 i=l 

Given the action of O(n) on V, we have the natural unitary representation on 
C(V) given by (Af)(x,i) = f{A~ l x,i) for A G O(n) and / G C(V). It follows 
that each space is 0(n)-irreducible and that two spaces Hk,i and Uy ,v are 
0(n)-equivalent if and only if k = fc'. Let 

{ e k ,i,i : k > 0, 1 < i < iV, and 1 < I < h k } 

be a complete orthonormal system of C(V) such that ek,i.i, ■ ■ • , £k,i,h k is a basis 
of iifc,i- By Bochner's characterization [5], a kernel K E C(V x V) is positive and 
0(n)-invariant if and only if 

oo N h k 

(8) K((x,i),(y,j)) = fi3,k^2ek,i',i(x,i)e k ,j',i{y,j), 

fe=0i'j'=l J=l 
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where each (fij,k) 1 ^j_ 1 is positive semidefinite and J^kLo \fijM < 00 f° r a ^ h 3- 

By the addition formula (cf. Chapter 9.6 of Andrews, Askey, and Roy pQ) we 
have 

1=1 ^nK ) 

for any orthonormal basis ei~,i, ■ ■ ■ , &k,h k °f Hk- It follows that 

^2e kt i> t i(x,i)e k ,j>,i(y,j) = 5 U '5 jr ( o n -i\ P k ( x ' v)> 

and substituting this into ^ completes the proof. □ 

Bochner's characterization for the kernel K, which we used above, usually as- 
sumes that the spaces under consideration are homogeneous, so that the decompo- 
sitions into isotypic irreducible spaces are guaranteed to be finite. This finitcness 
is then used to conclude uniform convergence. Since the action of O(n) on V is not 
transitive, we do not immediately have this guarantee. We can still use the char- 
acterization, however, since irreducible subspaces of C(V) have finite multiplicity. 

3. Translational packings of bodies and multiple-size sphere packings 



Before giving a proof of Theorem 1.3 we quickly present some technical con- 



siderations regarding density. Here we follow closely Appendix A of Cohn and 
Elkies [8]. 

Let K.\, . . . , /Cjv be convex bodies and V be a packing of translated copies of 
TC\, . . . , /Cjv, that is, V is a union of translated copies of the bodies, any two copies 
having disjoint interiors. We say that the density of V is A if for all p £ R™ we 
have 

r-s-oo vol B(p,r) 

where B(p, r) is the ball of radius r centered at p. Not every packing has a density, 
but every packing has an upper density given by 

vo\(B{p, r) n V) 
limsup sup — — r . 

r-foo P 6R™ VOl B{p,r) 

We say that a packing V is periodic if there is a lattice L C R™ that leaves V 
invariant, that is, which is such that V = x + V for all x £ L. In other words, 
a periodic packing consists of some translated copies of the bodies /Ci, . . . , fCjsr 
arranged inside the fundamental parallelotope of L, and this arrangement repeats 
itself at each copy of the fundamental parallelotope translated by vectors of the 
lattice. 

It is easy to see that a periodic packing has a density. This is particularly 
interesting for us, since in computing upper bounds for the maximum possible 
density of a packing we may restrict ourselves to periodic packings, as it is known 
(and easy to see) that the supremum of the upper densities of packings is also 
achieved by periodic packings (cf. Appendix A in Cohn and Elkies [5]). 

To provide a proof of the theorem wc need another fact from harmonic analysis, 
the Poisson summation formula. Let / : M. n — > C be a Schwartz function and L C M. n 
be a lattice. The Poisson summation formula states that, for every x £ R n , 

veL y ' ' ueL' 

where L* = { u £ W l : u ■ x £ Z for all x £ L } is the dual lattice of L and where 
vo\(M. n /L) is the volume of a fundamental domain of the lattice L. 
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Proof of Theorem 1.3 As observed above, we may restrict ourselves to periodic 
packings. Let then L C R™ be a lattice and x±, . . . , x m £ R n and r : {1, . . . , m} — > 
{1, . . . , N} be such that 

m 

■uGL i=l 

is a packing. This means that, whenever i ^ j or v ^ 0, bodies + IC r ^ and w + 
xj + JC r (j) have disjoint interiors. This packing is periodic and therefore has a 
well-defined density, which equals 



^ m 
— -. — r > VOl/C r 

ran /r\ r 



Consider the sum 

m 

(9) ^2 ^2 ( v °l^r( l )) 1/2 (vol/C r(i) ) 1/2 / I . ( j )l . (j) (w + a: J -ari). 

Applying the Poisson summation formula we may express ^ in terms of Fourier 
transform of /, obtaining 



J2 J2 (vol-ftT r ( l) ) 1/2 (volii: i . - ) ) 1 / 2 /r(i)rO)(w)e 



27riu-(xj—Xi) 



vol(R»/L) u£i ,.. =1 

where L* is the dual lattice of L. 

Since / satisfies condition (ii) of the theorem, matrix (/r(i)r(j)(^))^ is positive 
scmidefinite for every u £ R™. So the inner sum above is always nonnegative. If we 
then consider only the summand for u = 0, we see that ([9| is at least 

rn 

VoPn/Zj E( VOl ^W) 1/2 ( Vol ^( J -)) 1/2 /r(0r( J )(0) 

(10) > vol(Rn/L) .E vol/C -W vol/C K.) 

2 



- , T , ( voi/c r 



where the inequality comes from condition (i) of the theorem. 

Now, notice that whenever v ^ or % ^ j one has f r (i) r (j) {v+Xj~Xi) < 0. Indeed, 
since V is a packing, if v ^ or i ^ j then the bodies aij + /Oj) and u + + /C r (j) 
have disjoint interiors. But then also K, r u) and v + Xj — Xi + K-r(j) have disjoint 
interiors, and then from (iii) we see that f r (i)r(j)( v + x j ~ x i) _ ■ 0- 

From this observation we see immediately that ([9]) is at most 

m m 

(H) ^2vo\K. r{l) f r ( i}r(i} (0) < max{/ 4i (0) : i = 1, . . . , TV } vol/C r(i) . 

i=i i=i 

So (|9| is at least ( fl0| ) and at most (11 1. Putting it all together we get that 

vol(R"/L) H vol/C '-(») ^ max U»(°) :* = 1, JV}, 
proving the theorem. □ 



We mentioned in the beginning of the section that Theorem |1.3| is an analogue of 
the weighted theta prime number for a certain infinite graph. The connection will 



become more clear after we present a slightly more general version of Theorem 1.3 
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An L°° function /: R n -> C NxN is said to be of positive type if f(x) = f(-x)* 
for all x £ K" and for all L 1 functions p: M n — > C w we have 

P{y)*f(% ~ V)p{x) dxdy > 0. 

When N — 1 we have the classical theory of functions of positive type (see 
e.g. the book by Folland [16] for background). Many useful properties of such 
functions can be extended to the matrix-valued case (that is, to the N > 1 case) 
via a simple observation: a function /: W 1 — > <C NxN is of positive type if and only 
if for all p £ C N the function g p : M. n — > C such that 

g p (x) =p*f(x)p 

is of positive type. 

From this observation two useful classical characterizations of functions of posi- 
tive type can be extended to the matrix-valued case. The first one is useful when 
dealing with continuous functions of positive type. It states that a continuous 
and bounded function /: R" — > C NxN is of positive type if and only if for every 
choice Xxi ■ ■ ■ , x m of finitely many points in R n , the block matrix (/(xj — Xj)j t 
is positive semidefinite. 

The second characterization is given in terms of the Fourier transform. It states 
that an L 1 function /: W 1 —> C NxN is of positive type if and only if the ma- 
trix [fij(u)) i . —1 is positive semidefinite for all u £ W 1 . So in the statement of 
Theorem [T3J for instance, one could replace condition (i) by the equivalent condi- 
tion that / be a function of positive type. 

When N = 1, the previous two characterizations of functions of positive type 
date back to Bochner [6]. 

With this we may give an alternative and more general version of Theorem |1.3| 



Theorem 3.1. Let Jd, . . . , JC N be convex bodies in R n and let f: R n -> R NxN be 
a continuous and L 1 function. Suppose f satisfies the following conditions: 

(i) the matrix (/ij(0) — (vol/C i ) 1 / 2 (vol/C :) ) 1 / 2 )^ V _ 1 is positive semidefinite; 

(ii) / is of positive type; 

(iii) fij{x) < whenever K° n [x + JC°) = 0. 

Then the density of every packing of translates of K,\, . . . , /Cat in the Euclidean 
space W l is at most max{ fu{0) : i = 1, . . . , N}. 

Let V = R n X {1, . . . , N}. Notice that the kernel K : V x V -> R such that 

K{(x,i), (y,j)) = fij(x - y), 

implicitly defined by the function /, plays the same role as the matrix K from 



the definition of the theta prime number (cf. Section 2.2). For instance, this is a 
positive kernel, since / is of positive type and hence for any L 1 function p: V — > R 
we have that 

K((x,i), (y,j))p(x,i)p(y,j)d(x,i)d(y,j) > 0. 



v 



Theorem 3.1 can then be seen as an analogue of the weighted theta prime number 
for the packing graph with vertex set V that we consider. 

When one reads through the proof of Theorem [L3j the one step that fails when / 
is L 1 instead of Schwartz is the use of the Poisson summation formula. Indeed, 
sum ([9]) is not anymore well-defined in such a situation. The summation formula 
also holds, however, under somewhat different conditions that are just what we 
need to make the proof go through. The proof of the following lemma makes use of 
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the well-known interpretation of the Poisson summation formula as a trace formula, 
which for instance is explained by Terras [38l Chapter 1.3]. 

Lemma 3.2. Let f : ffi™ — > <C NxN be a continuous function of bounded support and 
positive type. Then for every lattice L C W 1 , every x £ R™, and all i, j — 1, . . . , N 
we have 

veL y ' ' u£L* 

Proof. Since each function /y is continuous and of bounded support, the func- 
tions gij : W 1 / L — > C such that 

veL 

are continuous. Indeed, the sum above is well-defined, being in fact a finite sum 
(since fij has bounded support), and therefore g^ can be seen locally as a sum of 
finitely many continuous functions. 

Let us now compute the Fourier transform of gij. For u £ L* we have that 

9ii(u) = I 9ij(x)e- 2niu - x dx 

JR"/L 



JR n /L r 



)e -2niu-x dx 



f ij (x)e- 2mu x dx 



y 

= fij(u) 

So we know that 

(12) ^W = ^7Z)E^H 

in the sense of L 2 convergence. Our goal is to prove that pointwise convergence 
also holds above. 

To this end we consider for % = 1, . . . , N the kernel K 4 : (R n /L) x (R n /L) -> C 
such that 

Ki{x,y) = ^2 fu(v +x-y). 
veL 

Since each function fa is of bounded support and continuous, each kernel Ki 
is continuous. Since for each i we have that fu(x) — fu(—x) for all x £ K n 
(since / is of positive type), each kernel Ki is self-adjoint. Notice that the func- 
tions x i — y (vol(R™ /L)) _1 / 2 e 27ri "' x , for u £ L* , form a complete orthonormal sys- 
tem of L 2 {R n /L). Each such function is also an eigcnfunction of Ki, with eigen- 
value fu(u). Indeed, we have 

Id{x, y)(vol(M"/L))- 1 / 2 e 2 ""^ dy 

= (vol(M'VL))- 1 / 2 / V f u {v + x- y)e 2 ™-y dy 
jRn / L veL 

= (v0l(M"/L))- 1 /2 f f u ( X - y) e ^-V dy 



(vol(M'VL))- 1 / 2 / fu(y)e 2 ™< x -yUy 
/ ll (u)(vol(E"/L))- 1 / 2 e 2 ""- a; . 
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Since / is of positive type, the matrices of Fourier transforms (/y (it)) j 
for u <E K n , are all positive semidefinite. In particular this implies that the Fourier 
transforms of fa , for i = 1, . . . , N, are nonnegative. So we see that each K{ is 
a continuous and positive kernel. Mercer's theorem (see for instance Courant and 
Hilbert [TU]) then implies that Ki is trace-class, its trace being the sum of all its 
eigenvalues. So for each i = 1, . . . , N, the series 

(13) E /«(**) 

u£L* 

converges, and since each summand is nonnegative, it converges absolutely. 

Suppose now that i, j = 1, . . . , N are so that i j. Since the matrices of Fourier 
transforms are nonnegative, for all u £ M. n we have that the matrix 

ffu{u) hj{u)\ 

is positive semidefinite, and this in turn implies that \fij(u)\ 2 < fa{u)fjj(u) for 



all u e W 1 . Using then the convergence of the series (13 1 and the Cauchy-Schwarz 
inequality, one gets 

E ^ E (/«m/«w) 1/2 < ( E /«(«)) V2 ( E ' 

and we see that in fact for alH, j = 1, . . . , N the series 



E /«(< 



uEL* 



converges absolutely. 



This convergence result shows that the sum in (12 1 converges absolutely and 



uniformly for all x G M. n /L. This means that the function defined by this sum is 



a continuous function, and since gij is also a continuous function, and in ( 12 ) we 
have convergence in the L 2 sense, we must also then have pointwise convergence, 
as we aimed to establish. □ 

With this we may give a proof of Theorem |3.1| 



Proof of Theorem \3.1\ Using Lemma |3.2[ we may repeat the proof of Theorem |1.3| 
given before, proving the theorem for continuous functions of bounded support. To 
extend the proof also to continuous L 1 functions we use the following trick. 

Let /: R™ — > M. NxN be a continuous and L 1 function satisfying the hypothesis 
of the theorem. For each T > consider the function g T : R n — > M. NxN defined 
such that 

T _vol(B(0,T)nB(x,T)) 
9 ( >~ vol 5(0, T) f[Xh 

where B(p,T) is the ball of radius T centered at p. 

It is easy to see that g T is a continuous function of bounded support. It is also 
clear that it satisfies condition (iii) from the statement of the theorem. We now 
show that g T is a function of positive type, that is, it satisfies condition (ii). 

For this pick any points xi, . . . , x m € R™. Let Xi '■ R™ ~~ ► {0: 1} be the character- 
istic function of B{xi, T) and denote by (4>, ip) the standard inner product between 
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functions <j> and ip in the Hilbert space L (R n ). Then 

vol(5(0,T) n B{x t - xj,T)) 
volB(0,T) 



^ _ ^ = m w^ ) n»&- Xi ,i ) > nxi _ xj) 



vol(B(xi,T) n B(xj,T)) 

vol «((). / :■ ' ' - 



f{%i Xjj 

ii-D^u, 1 ) 

(Xi,Xj) 



f {Xi Xj ) . 



volS(0,T)- 

This shows that the matrix {g T {xi — Xj)Yij—x i s positive semidefinite, being the 
Hadamard product, i.e. entrywise product, of two positive semidefinite matrices. 
We therefore have that g T is of positive type. 

Now, g T is a continuous function of positive type and bounded support, satisfying 
condition (iii). It is very possible, however, that g T does not satisfy condition (i), 
and so the conclusion of the theorem may not apply to g T . Let us now fix this 
problem. 

Notice that gfj converges pointwise to fij as T — » oo. Moreover, for all T > 
we have \gfj{x)\ < \fij{x)\. It then follows from Lebesgue's dominated convergence 
theorem that gfj(0) — > /y(0) as T — > oo. This means that there exists a num- 
ber To > such that for each T > To we may pick a number a(T) > 1 so that the 
function h T : R n -> C NxN such that 

hft(x) = a{T)gl(x) for i = 1, . . . , N, 

h%{x) = gfjix) for i, j = 1, . . . , N with % ^ j 

for all x G R™ satisfies condition (ii). We may moreover pick the numbers a(T) in 
such a way that limT-j-oo ct{T) = 1. 

It is also easy to see that each function h T is of positive type and bounded 
support and satisfies condition (iii). Hence the conclusion of the theorem applies 
for each h T , and so for every T > To we see that 

M T = max{ hf^O) : i = 1, . . . , N } 

is an upper bound for the density of any packing of translated copies of K.%, . . . , /Cat. 
But then, since g^(0) = fu{0) for all T > 0, and since limy-i-oo ot{T) = 1, we see 
that 

max{ fu(0) : i = 1, . . . , N } = lim M T , 

T— >oo 

finishing the proof. □ 



4. Computations for binary spherical cap packings 
In this and the next section we describe how we obtained the numerical results of 



Sections 1.2 and 1.3 Our approach is computational: to apply Theorems 1.2 and 1.3 
we use techniques from semidefinite programming and polynomial optimization. 

We start by briefly discussing the case of binary spherical cap packings. Next we 
will discuss the more computationally challenging case of binary sphere packings. 

It is a classical result of Lukacs (see e.g. Theorem 1.21.1 in Szego [37]) that a real 
univariate polynomial p of degree 2d is nonnegative on the interval [a, b] if and only 
if there are real polynomials q and r such that p(x) = (q(x)) 2 + {x — a){b—x)(r{x)) 2 . 
This characterization is useful when we combine it with the elementary but powerful 
observation (discovered independently by several authors, cf. Laurent [57]) that a 
real univariate polynomial p of degree 2d is a sum of squares of polynomials if and 
only if p(x) — v(x) 1 Qv{x) for some positive semidefinite matrix Q, where v{x) — 
(1, x, . . . , x d ) is a vector whose components are the monomial basis. 
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Let ai, ■ • ■ , ctN € (0, 7r] be angles and d be an integer. Write Vq(x) — (1, x, . . . , x d ) 
and vi(x) = (1, x, . . . , x^ 1 ). Using this characterization together with Theo- 
rem |1.2[ we see that the optimal value of the following optimization problem gives an 
upper bound for the density of a packing of spherical caps with angles u\, . . . , otjq. 



Problem A. For k = 0, 2d, find positive semidefinite matrices (jy.fc) i ._ v 
and for i, j = 1, . . . , N, find (d + 1) x (d + 1) positive semidefinite matrices Qij 
and d x d positive semidefinite matrices Rij that minimize 



2d 



/ J fu.k : i = I, ■ ■ ■ , N 

fe=0 



and are such that 



is positive semidefinite and the polynomial identities 

2d 

( 14 ) Ylh,kPk(u) + (Q i:i ,vo(u)vo(u) T ) 

+ (Rij, (u + l)(cos(ai + ay) — ii)t>i(M)?;i(it) T ) = 
arc satisfied for i, j = 1, . . . , N. < 
Above, (A, B) denotes the trace inner product between matrices A and B. Prob- 



lem A is a semidefinite programming problem, as the polynomial identities ( 14 ) can 
each be expressed as 2d+l linear constraints on the entries of the matrices involved. 
Indeed, to check that a polynomial is identically zero, it suffices to check that the 
coefficient of each monomial 1, x, . . . , x 2d is zero, and for each such monomial we 
get a linear constraint. 

In the above, we work with the standard monomial basis 1, x, . . . , x 2d , but we 
could use any other basis of the space of polynomials of degree at most 2d, both 



to define the vectors vq and ui and to check the polynomial identity (14). Such a 
change of basis does not change the problem from a formal point of view, but can 
drastically improve the performance of the solvers used. In our computations for 
binary spherical cap packings it was enough to use the standard monomial basis. 
We will see in the next section, when we present our computations for the Euclidean 
space, that a different choice of basis is essential. 

We reported in Section [L2"| on our calculations for N = 1, and 2 and n — 3, 4, 
and 5. The bounds, for the angles under consideration, do not seem to improve 
beyond d = 25, so we use this value for d in all computations. To obtain these 
bounds we used the solver SDPA-QD, which works with quadruple precision floating 
point numbers, from the SDPA family [T8] , 



5. Computations for binary sphere packings 

In this section we discuss our computational approach to find upper bounds for 
the density of binary sphere packings using Theorem |1.3| This is a more difficult 
application of semidefinite programming and polynomial optimization techniques 
than the one described in Section [4] 

It is often the case in applications of sum of squares techniques that, if one 
formulates the problems carelessly, high numerical instability invalidates the final 
results, or even numerical results cannot easily be obtained. This raises questions 
of how to improve the formulations used and the precision of the computations, so 
that we may provide rigorous bounds. We also address these questions and, since 
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the techniques we use and develop might be of interest to the reader who wants to 
perform computations in polynomial optimization, we include some details. 

5.1. Theorem |1.3| for multiple-size sphere packings. In the case of multiple- 
size sphere packings, Theorem |1.3| can be simplified. The key observation here 
is that, when all the bodies K,i are spheres, then condition (iii) depends only on 
the norm of the vector x. More specifically, if each Ki is a sphere of radius rj, 
then K\ n (x + Kfj = if and only if ||a;|| > n + rj . 

So in Theorem 11.31 one can choose to restrict oneself to radial functions. A 
function /: K™ — > C is radial if the value of f(x) depends only on the norm of x. 
If /: 1" — >• C is radial, for t > we denote by f(t) the common value of / for 
vectors of norm t. 

The Fourier transform f(u) of a radial function / also depends only on the norm 
of u; in other words, the Fourier transform of a radial function is also radial. By 
restricting ourselves to radial functions, we obtain the following version of Theo- 
rem [Ol 

Theorem 5.1. Let n, rjv > and let f : K™ — » M. NxN be a matrix-valued 
function whose every component is a radial Schwartz function. Suppose f sat- 
isfies the following conditions: 

(i) the matrix (fij(0) — (voli?(ri)) 1 / 2 (voli?(rj)) 1 / 2 )^._ 1 is positive semidefi- 
nite, where B(r) is the ball of radius r centered at the origin; 

(ii) the matrix of Fourier transforms is positive semidefinite for 
every t > 0; 

(iii) fij(w) < if w > n + r j7 for i, j = 1, N. 

Then the density of any packing of spheres of radii r\, . . . , r^ in the Euclidean 
space W l is at most max{ fu(0) : i = 1, . . . , N}, 

One might ask whether the restriction to radial functions worsens the bound of 
Theorem |1.3| For spheres, this is not the case. Indeed, suppose each body /Q is a 
sphere. If /: R™ — > R NxN is a function satisfying the conditions of the theorem, 
then its radialized version, the function 

70*0= / /(||s||0dco„(0. 

also satisfies the conditions of the theorem, and it provides the same upper bound. 
This shows in particular that, for the case of multiple-size sphere packings, Theo- 
rem [53] is equivalent to Theorem |1.3| 

5.2. A semidefinite programming formulation. To simplify notation and be- 
cause it is the case of our main interest we now take N = 2. Everything in the 
following also goes through for arbitrary N with obvious modifications. 

To find a function / satisfying the conditions of Theorem |5.1| we specify / via 
its Fourier transform. Let d > be an odd integer and consider the even func- 
tion ip: K>o -> M 2x2 such that 

d 

fc=0 

where each is a real number and a^.fe = for all k. We set the Fourier 
transform of / to be 

Notice that each is a Schwartz function, so its Fourier inverse is also Schwartz. 
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The reason why we choose this form for the Fourier transform of / is that it 
makes it simple to compute / from its Fourier transform by using the following 
result. 

Lemma 5.2. We have that 

(15) / \\ u fk e -^\\uf e 2 m u-x du = ^ 7r -* e -T|l»ll"£»/ 2 - 1 ( fl .|| a .||2) > 



where L^ 2 1 is the Laguerre polynomial of degree k with parameter n/2 — 1. 

For background on Laguerre polynomials, we refer the reader to the book by 
Andrews, Askey, and Roy pQ. 



Proof. With f(u) = \\u\\ 2k e-*" u W , the left hand side of is equal to f(-x). By 
[B Theorem 9.10.3] we have 

poo 

f(-x) = 2^|M| 1 -"/ 2 / ,s 2fc e— 2 J„ /2 _ 1 (2 7 r S || a; ||) S "/ 2 d S , 
Jo 

where J n /2-i is the Bessel function of the first kind with parameter n/2 — 1. Using 
[H Corollary 4.11.8] we see that this is equal to 

__*r(fc + n/2) | W|a / —k 
r(n/2) 1 X Vn/2' 

where \F\ is a hypergeometric series. 
By 1, (6.2.2)] we have 



(16) - k \^e 'INI lFl ( 2 M\ X f), 



Vn/2 / (n/2) fe 



2 \ 



where {n/2) k = (n/2)(l + n/2) ■ • • (fc - 1 + n/2). 

By substituting this in (16 1, and using the property that T(x + 1) = xT(x) for 
all a: / 0,-1, —2, . . ., we obtain the right hand side of ( 15 ) as desired. □ 



So we have 

d 



fij(x) = f ^(||n||) e --ll«ll 2 e 2 ^du = ^ a4J - fc fc!7r- fc e-^^ 2 L^ /2 -\7r||x|| 2 )^ 

^ RI1 k=0 

Notice that it becomes clear that /y is indeed real-valued, as required by the 
theorem. 

Consider the polynomial 

d 

P {t) = Y J ^t 2k . 

k=Q 



According to Lemma 5.2 if g(x) is the Fourier inverse of g(u) = p(||«||)e~ 7r " u " , 
then = g(||a;||)e — "'I 31 ' , where 

d 

q(w)=J2akk\ir- k L n k /2 - 1 (nw 2 ) 

is a univariate polynomial. We denote the polynomial q above by J" _1 [p]. Notice 
that J-" _1 [j>] is obtained from p via a linear transformation, i.e., its coefficients are 
linear combinations of the coefficients of p. With this notation we have 

^(x)=7--Vd(IMI)e- 7r|N|2 . 

Let 

2 d 

(17) a(t,y 1 ,y 2 ) = ^2 " .jj. ' 2 '" !J,!I , ■ 

i,j = l k=0 
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If this polynomial is a sum of squares, then it is nonnegative everywhere, and 
hence the matrices ((fij (i)J i are positive semidefinite for all t > 0. This implies 
that / satisfies condition (ii) of Theorem |5.1| (The converse is also true, that if 
the matrices ((/^(i))^ are positive semidefinite for all t > 0, then a is a sum of 
squares; For a proof see Choi, Lam, Reznick [7J. This fact is related to the Kalman- 
Yakubovich-Popov lemma in systems and control; see the discussion in Aylward, 
Itani, and Parrilo [2].) 

Moreover, we may recover ip, and hence /, from a. Indeed we have 
cp n (t) = a(t,l,0), 

(18) (p 22 (t) =a(t, 0,1), and 

<p 13 (t) = (l/2)(«r(t, 1, 1) - a(t, 1, 0) - a(t, 0, 1)). 

So we can express condition (i) of Theorem |5.1| in terms of a . We may also express 
condition (iii) in terms of a, since it can be translated as 

(19) J-^ 1 [ifij](w) < for all w > + i'j and i, j = 1, 2 with i < j. 



If we find a polynomial a of the form (|17| that is a sum of squares, is such that 



(20) (^(0) - (volBir^^olBir,))^) 



is positive semidefinite, and satisfies (19), then the density of a packing of spheres 
of radii r\ and r 2 is upper bounded by 

max{.F-Vi](0),^~>22](0)}- 

We may encode conditions ( |19[ ) in terms of sums of squares polynomials (cf. Sec- 
tion |4| , and therefore we may encode the problem of finding a a as above as a 
semidefinite programming problem, as we show now. 

Let Pq, Pi, . . . be a sequence of univariate polynomials where polynomial 
has degree k. Consider the vector of polynomials v, which has entries indexed 
by {0,...,Ld/2j} given by 

v(t) k = P k {t 2 ) 
for k = 0, . .., [d/2\. We also write V(t) = v(t)v(t) T . 

Consider also the vector of polynomials m with entries indexed by {1,2} x 
{0,...,[d/2\} given by 

m(t,y 1 ,y 2 ) it k = Pk{t 2 )Vi 
for i, j = 1,2 and k = 0, [d/2\ . 

Since a is an even polynomial, it is a sum of squares if and only if there are 
positive semidefinite matrices So, Si <E IR(' i + 1 ) x ( d + 1 ) such that 

e{t,y\,y2) = {Sa,m{t,y ll y 2 )m(t,y 1 ,y 2 ) J ) + (S 1 ,t 2 m(t,y 1 ,y 2 )m(t,y 1 ,y 2 ) T ) . 

From the matrices So and Si we may then recover ipij and also J 7-1 [ipij] . A more 
convenient way for expressing ip^ in terms of So and Si is as follows. Consider the 
matrices 

Y u = (l ° ), Y 22 =r Q °Y and Y 12 - l * 



Then 
and 



<Pij(t) = (So, V(t) ® Yij) + (S!,t 2 V{t) ® Yu) 



F-^ipijMw) = (S ,J- 1 [^(i)]H®^) + (S 1 ,J- 1 [t 2 y(i)]H<8>^) ! 
where J 7 ^ 1 , when applied to a matrix, is applied to each entry individually. 

With this, we may consider the following semidefinite programming problem for 
finding a polynomial a satisfying the conditions we need. 
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Problem B. Find (d + 1) X (d + 1) real positive semidefinite matrices So and Si, 
and (|_d/2j + 1) x (L^/2J + 1) real positive semidefinite matrices Qn, Q22, and Q i2 
that minimize 

max{(S , T-^Vit)]^) ® Y u ) + (S^J^MOKO) ® y u ), 

(s , j-M^WKo) ® y 22 > + (Sx,^ 1 ^^^)]^) ® f 22 )} 

and are such that 

(21) ((So, F(0) ® y y ) - (vol J B(r i )) 1 / 2 (vol( J B(r,)) 1 / 2 )J. =i 

is positive definite and the polynomial identities 

+ {Q l] ,{w 2 -{r l + r j ) 2 )V{w))={) 
arc satisfied for i, j = 1, 2 and i < j. < 



Any solution to this problem gives us a polynomial er of the shape ( 17) which is 



a sum of squares and satisfies conditions (19) and (20), and so the optimal value 
is an upper bound for the density of any packing of spheres of radius r x and r 2 . 
There might be, however, polynomials a satisfying these conditions that cannot 



be obtained as feasible solutions to Problem B, since condition (22) is potentially 



more restrictive than condition ( 19 1 (compare Problem B above with Lukacs' result 
mentioned in Section [4]) . In our practical computations this restriction was not 
problematic and we found very good functions. 

Observe also that Problem B is really a semidefinite programming problem. 



Indeed, the polynomial identities in ( 22 ) can each be represented as d + 1 linear 
constraints in the entries of the matrices Si and Qij. This is the case because 
testing whether a polynomial is identically zero is the same as testing whether each 
monomial has a zero coefficient and so, since all our polynomials are even and of 
degree 2d, we need only check if the coefficients of the monomials x 2k are zero 
for k = 0, . . . , d. 

5.3. Numerical results. When solving Problem B, we need to choose a sequence 
Po, Pi, ... of polynomials. A choice which works well in practice is 

where //& is the absolute value of the coefficient of L^ 2 ^ 1 (2nt) with largest absolute 
value. We observed in practice that the standard monomial basis performs poorly. 



To represent the polynomial identities in ( 22 ) as linear constraints we may check 
that each monomial x 2k of the resulting polynomial has coefficient zero. We may 
use, however, any basis of the space of even polynomials of degree at most 2d to 
represent such identities. Given such a basis, we expand each polynomial in it and 
check that the expansion has only zero coefficients. The basis we use to represent 
the identities is P (t 2 ), Pi(t 2 ), . . . , P d (t 2 ), which we observed to work much better 
than t°, t 2 , . . . , t 2d . Notice that no extra variables are necessary if we use a different 
basis to represent the identities. We need only keep, for each polynomial in the 
matrices J r ~ 1 [V(t)](w), J r ~ 1 {t 2 V(t)](w), w 2 V(w), and V(w), its expansion in the 
basis we want to use. 

The plot in Figure [3] was generated by solving Problem B with d = 31 using 
the solver SDPA-GMP from the SDPA family [18]. The input for the solver was 
generated by a SAGE [36 program working with floating-point arithmetic and 
precision of 256 bits. For each dimension 2, . . . , 5 we solved Problem B with r\ = 
r/1000 and r 2 = 1 for r = 200, . . . , 1000; the reason we start with r = 200 is that 
for smaller values of r the solver runs into numerical stability problems. We also 
note that the solver has failed to solve some of the problems, and these points have 
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been disconsidered when generating the plot. The number of problems that could 
not be solved was small though: for n = 2 all problems could be solved, for n = 3 
there were 6 failures, for n = 4 we had 18 failures, and finally for n = 5 the solver 
failed for 137 problems. 

With our methods we can achieve higher values for d, but we noticed that 
the bound does not improve much after d = 31. For instance, in dimension 2 
for r\ — 1/2 and — 1, we obtain the bound 0.9174466 . . . for d = 31 and the 
bound 0.9174426 ... for d = 51. 



* * * 



In the previous account of how the plot in Figure [3] was generated, we swept under 
the rug all precision issues. We generate the data for the solver using floating-point 
arithmetic, and the solver also uses floating-point arithmetic. We cannot therefore 
be sure that the optimal value found by the solver gives a valid bound at all. 

If we knew a priori that Problem B is strictly feasible (that is, that it admits 
a solution in which the matrices Si and Qij are positive definite), and if we had 
some control over the dual solutions, then we could use semidefinite programming 
duality to argue that the bounds we compute are rigorous; see for instance Gijswijt 
[T71 Chapter 7.2] for an application of this approach in coding theory. The matter 
is however that we do not know that Problem B is strictly feasible, neither do we 
have knowledge about the dual solutions. In fact, most of our approach to provide 
rigorous bounds consists in finding a strictly feasible solution. 

A naive idea to turn the bound returned by the solver into a rigorous bound 
would be to simply project a solution returned by the solver onto the subspace 



given by the constraints in (22 1. If the original solution is of good quality, then this 
would yield a feasible solution. 

There are two problems with this approach, though. The first problem is that 
the matrices returned by the solver will have eigenvalues too close to zero, and 
therefore after the projection they might not be positive semidefinite anymore. We 
discuss how to handle this issue below. 

The second problem is that to obtain a rigorous bound one would need to per- 
form the projection using symbolic computations and rational arithmetic, and the 
computational cost is just too big. For instance, we failed to do so even for d = 7. 

Our approach avoids projecting the solution using symbolic computations. Here 
is an outline of our method. 

(1) Obtain a solution to the problem with objective value close the optimal 
value returned by the solver, but in which every matrix Si and Qij is positive 
definite by a good margin and the maximum violation of the constraints is 
very small. 

(2) Approximate matrices Si and Qij by rational positive semidefinite matri- 
ces Si and Qij having minimum eigenvalues at least A; and fiij , respectively. 



(3) Compute a bound on how much constraints ( 22 ) are violated by Si and Qij 
using rational arithmetic. If the maximum violation of the constraints is 
small compared to the bounds A^ and [iij on the minimum eigenvalues, then 
we may be sure that the solution can be changed into a feasible solution 
without changing its objective value too much. 

We now explain how each step above can be accomplished. 

First, most likely the matrices Si, Qij returned by the solver will have eigenvalues 
very close to zero, or even slightly negative due to the numerical method which 
might allow infeasible steps. 

To obtain a solution with positive definite matrices we may use the following 
trick (cf. Lofberg [3U]). We solve Problem B to find its optimal value, say z* . Then 
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we solve a feasibility version of Problem B in which the objective function is absent, 
but we add a constraint to ensure that 

mzx{{S ,F- 1 [V(t)}(0)®Y 11 ) + (Si,F- 1 [t 2 V(t)}(0)®Y n ), 

(5 , J- x [y(i)](0) ® Y 22 ) + (S u F- 1 [t 2 V(t)](0)®Y 22 )} <z*+ v , 

where r\ > should be small enough so that we do not jeopardize the objective value 
of the solution, but not too small so that a good strictly feasible solution exists. 
(We take i] = 10~ 5 , which works well for the purpose of making a plot.) The 
trick here is that most semidefinite programming solvers, when solving a feasibility 
problem, will return a strictly feasible solution — the analytical center — , if one 
can be found. 

This partially addresses step (1), because though the solution we find will be 
strictly feasible, it might violate the constraints too much. To quickly obtain a 
solution that violates the constraints only slightly, we may project our original 



solution onto the subspace given by constraints ( 22 ) using floating-point arithmetic 
of high enough precision. If the solution returned by the solver had good precision 
to begin with, then the projected solution will still be strictly feasible. 

As an example, for our problems with d = 31, SDPA-GMP returns solutions that 
violate the constraints by at most 10 -30 . By doing a projection using floating-point 
arithmetic with 256 bits of precision in SAGE, we can bring the violation down to 
about 10~ 70 without affecting much the eigenvalues of the matrices. 

So we have addressed step (1). For step (2) we observe that simply converting 
the floating-point matrices Si, Qij to rational matrices would work, but then we 
would be in trouble to estimate the minimum eigenvalues of the resulting rational 
matrices in a rigorous way. Another idea of how to make the conversion is as follows. 

Say we want to approximate floating-point matrix A by a rational matrix A. 
We start by computing numerically an approximation to the least eigenvalue of A. 
Say A is this approximation. We then use binary search in the interval [A/2, A] to 
find the largest A so that the matrix A — XI has a Cholesky decomposition; this 
we do using floating-point arithmetic of high enough precision. If we have this 
largest A, then 

A = LL T + XI 

where L is the Cholesky factor of A — XI. Then we approximate L by a rational 
matrix L and we set 

A = IU + XI, 

obtaining thus a rational approximation of A and a bound on its minimum eigen- 
value. 



Our idea for step (3) is to compare the maximum violation of constraints (22) 
with the minimum eigenvalues of the matrices. To formalize this idea, suppose that 
constraints |22| are slightly violated by Si, Qij. So for instance we have 



(So, J-^WIM ® *ii) + (S U T- l \t 2 V{t)]{w) ® Yn) 

+ (Q 11 ,(w 2 -(2r 1 ) 2 )V(w)) = p, 



where p is an even polynomial of degree at most 2d. Notice that we may compute an 
upper bound on the absolute values of the coefficients of p using rational arithmetic. 

To fix this constraint we may distribute the coefficients of p in the matrices So 
and Qn (a very similar idea was presented by Lofberg [29] ) . To make things precise, 
for k = 1, . . . , d write 

i(k) = mva{[d/2\,k- 1}, 
j(k) =k-l- i(k). 
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Pairs (i(k),j(k)) correspond to entries of the matrix V(w). Notice that the poly- 
nomial (w 2 — (2ri) 2 )V (w)i(k)j(k) has degree 2k. 
So the polynomials 

R Q =T- 1 [V(t) oa }(w), 

R 1 = (w 2 ~(2r 1 ) 2 )V(w) l(imi)l 



Rd = (w 2 - {2r 1 ) 2 )V(w)i( d ) j(d) 

form a basis of the space of even polynomials of degree at most 2d. We may then 
express our polynomial p in this basis as 

p = a R H + a d R d . 

Now, we subtract ap from (/5o)(i,o),(i,o) an d cx-k from for A = 1, . . . , d. 



The resulting matrices satisfy constraint (23), and as long as the are small 
enough, they should remain positive semidefinite. More precisely, it suffices to 
require that d \\(a>i, . . . , otd)\\oa < Mil an d |a | < A . 

There are two issues to note in our approach. The first one is that it has to be 



applied again twice to fix the other two constraints in ( 22 ) . The applications do not 
conflict with each other: in each one we change a different matrix Qij and different 
entries of Sq. We have to be careful though that we consider the changes to Sq at 
once in order to check that it remains positive semidefinite. 

The second issue is how to compute the coefficients at ■ Computing them explic- 
itly using symbolic computation is infeasible. One way to do it then is to consider 
the basis change matrix between the bases x 2k , for k = 0, . . . , d, and Ro, . . . , R d , 
which we denote by U. Then we know that 

\\(a , . . . 

iOtd)\\oc — \\U ||oo||P||oo> 

where ||p|| oo is the oo-norm of the vector of coefficients of p in the basis x 2k . 

So if we have an upper bound for 1 1 1 1 1 oo we are done. To quickly find such 
an upper bound, we use an algorithm of Higham |24j (cf. also Higham |23j ) which 
works for triangular matrices, like U. This bound proved to be good enough for 
our purposes. 

6. Improving sphere packing bounds 



nJO| i 



We now prove Theorem |1.4| and show how to use it in order to compute the 
bounds presented in Table [ 



Proof of Theorem \1.4\ Let x%, . . . , xn £ 1" and L C M. n be a lattice such that 

N 

\J\Jv + Xi + B 

v£L i=l 

is a sphere packing, where B is the ball of radius 1/2 centered at the origin. We may 
assume that, if i ^ j and u^O, then the distance between the centers of v + Xi + B 
and Xj + B is greater than 1 + e m . Indeed, we could discard all Xi that lie at 
distance less than 1 + e m from the boundary of the fundamental parallelotope of L. 
If the fundamental parallelotope is big enough (and if it is not, we may consider a 
dilated version of L instead), this will only slightly alter the density of the packing, 
and the resulting packing will have the desired property. 
Consider the sum 

N 

(24) EE/(» + K-ij). 

— 1 vGL 
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Using the Poisson summation formula, we may rewrite it as 

1 N 
vol(M«/L) ^ ^ I( ' 

v ' ' i,j=l u^L* 

By discarding all summands in the inner sum above except the one for u = 0, 
we see that (24) is at least 

iV 2 volB 
vol(M™/L)' 

For fc = 1, . . . , m, write F k = { : \\n - Xj\\ G [1 + e k -i, 1 + e k ) }■ Then we 
see that ( 24 ) is at most 

Nf(0) + r h \F 1 \+--- + r lm \F m \. 



So we see that 



< /(o) + m + ■ ■ ■ + Vn 



vol(R"/L) - J w ' JV ' N ' 

Notice that the left-hand side above is exactly the density of our packing. Now, 
from the definition of M(e), it is clear that for k — 1, . . . , m we have 

^ + --- + ^<M( £fc ), 
N N ~ V h 

and the theorem follows. □ 



To find good functions / satisfying the conditions required by Theorem |1.4| we 
used the same approach from Section [5] We fix an odd positive integer d and 
specify / via its Fourier transform, writing 

d 

(pit) =J2 a k t2k 
fe=0 

and setting 

/» = ^(H|)e-ll< 
Using Lemma |5.2| we then have that 

/(x)=7- 1 M(||x||)e-INI 2 , 

where 

d 

fc=0 

is a polynomial obtained as a linear transformation of ip. 

Constraint (ii), requiring that f(u) > for all u G K™, can be equivalently 
expressed as requiring that the polynomial ip should be a sum of squares. 

Recalling the result of Lukacs mentioned in Section |4j one may also express 
constraint (iii) in terms of sums of squares: one simply has to require that there 
exist polynomials po(w) and qo(w) such that 

^MM = -boM) 2 - (w 2 - (1 + s m f)(q (w)f. 

In a similar way, one may express constraints (iv). For instance, for a given k, 
we require that there should exist polynomials p k (w) and q k (w) such that 

F- l [<p]{w)e-< 1+s *-^ - m = ~( Pk (w)) 2 -(w- (1 +e fc _i))((l +e k ) - w)(q k (w)) 2 , 

and this implies (iv). 

So we may represent the constraints on / in terms of sums of squares, and there- 
fore also in terms of semidefinite programming, as we did in Sections [4] and [5j There 
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is only the issue that now we want to find a function / that satisfies constraints (i)- 
(iv) of the theorem and that minimizes the maximum in ^ . This does not look like 
a linear objective function, but since by linear programming duality this maximum 
is equal to 

min f{0) + yiU(e 1 ) + ---+y m U(e m ) 

y%-\ h Vm > Vi for % = 1, . . . , m, 

y k > for k = 1, . . . , m, 

we may transform our original problem into a single minimization semidefinite 
programming problem, the optimal value of which provides an upper bound for the 
densities of sphere packings. 

It is still a question how to compute upper bounds for M(e). For this we use 
upper bounds on the sizes of spherical codes. A spherical code with minimum 
angular distance < 8 < tt is a set C C S'™^ 1 such that the angle between any two 
distinct points in C is at least 8. In other words, a spherical code with minimum 
angular distance 8 gives as packing of spherical caps with angle 9/2. We denote 
by A(n,8) the maximum cardinality of any spherical code in S"™" 1 with minimum 
angular distance 8. 

There is a simple relation between A(n,8) and M(e). Namely, if e < v2 — 1, 
then 

M(e) < A(n, arccosi(e)), 

where 

1 

2(l + £) 

To see this, suppose x, y £ W 1 are such that ||x||, \\y\\ £ [1, and \\x — y\\ > 1. 
Then by the law of cosines, if is the angle between x and y, we have 

COS (7 — 

2|klllbll 

The maximum of the right-hand side above for vectors x and y such that ||a;||, 
Hj/H £ [1,1 + e) and ||a; — y\\ > 1 gives 

Indeed, to maximize the right-hand side above, we may assume that — y\\ = 1. 
Then 

where a = ||x|| and /3 = ||y||. 

If we compute the derivative of 0(a, /3) with respect to a we obtain 

a 2 -/3 2 + l 
2a 2 f3 ' 

From this we see that, since e < V2 — 1, for a fixed /3 € [1,1+ e), function 6(a, /3) 
is increasing in a, for a > 1. Similarly, by taking the derivative with respect to /?, 
one may conclude that for a fixed a £ [1, 1 + e), function 6 (a, /3) is increasing in 0, 
for p > 1. So 0(a, /3) is maximized in our domain when one takes a = j3 = 1 + s. 
This implies that 

co S g<e(i + £ ,i + £ ) = i- 2(1 | e)2 , 

and so we have t(e). 

For the bounds of Table[l]we took d = 31. To compute upper bounds for A(n, #) 
we used the semidefinite programming bound of Bachoc and Vallcntin [4 . The 
bounds we used for computing Table [T] are given in Table [2j 

Finally, we mention that all numerical issues discussed in Section [5] also happen 
with the approach we sketched in this section. In particular, the choices of bases are 
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important for the stability of the semidefinite programming problems involved. We 
use the same bases as described in Section[5]though, so we skip a detailed discussion 
here. Notice moreover that our bounds are rigorous, having been checked with the 
same approach described in Section [5j 



14), (0.113864, 15) 

26), (0.036951, 27) 

47), (0.017446, 48) 

81), (0.008097, 82) 



Dimension (e, U(e)) pairs 

3 (0.022753, 12), (0.054092, 13), (0.082109, 

4 (0.008097, 24), (0.017446, 25), (0.025978, 

5 (0.003013, 45), (0.008097, 46), (0.013259, 

6 (0.002006, 79), (0.004024, 80), (0.006054, 

7 (0.001001, 136), (0.002006, 137), (0.003013, 138), (0.004024, 139), 
(0.005037, 140) 

9 (0.003013,373), (0.029233,457), (0.030325,459), (0.031421,464), 

(0.032520,468), (0.033622,473) 

Table 2. For each dimension considered in Table Q] we show here 
the sequence £!<•••< e m and the upper bounds U(sk) used in 



our application of Theorem 1.4 



We refrained from performing similar calculations for higher dimensions because 
of two reasons. Firstly, we expect that the improvements are only minor. Secondly, 
the computations of the upper bounds for M(s) in higher dimensions require sub- 
stantially more time as one needs to solve the semidefinite programs with a high 
accuracy solver, see Mittelmann and Vallentin [32) . 
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